Support for Seq2Seq Models (T5, T5Gemma, etc.) #3153

maxzuo · 2025-08-14T03:50:55Z

PR Description

Adds support for Seq2Seq models: AutoModelForSeq2SeqLM.

Why

Seq2Seq models are not directly supported, despite support for all model architectures. This is because FastModel.from_pretrained sets the auto_model parameter to either AutoModelForCausalLM or AutoModelForVision2Seq/AutoModelForImageTextToText.

Further, since models like T5 have class names ending in ForConditionalGeneration, unsloth registers this as a VLM and tries to load it as such.

I use AutoModelForSeq2SeqLM._model_mapping to check if a model config is registered as a Seq2Seq model. This logic can be extended to other auto models (e.g., AutoModelForSequenceClassification) if desired.

Links

Support for T5 has some community interest:

Resolves please give t5 support. #719
Resolves Support T5 models #643

Datta0 · 2025-08-18T11:36:24Z

Hey @maxzuo thanks for the contribution
It'd be of great help if you can possibly create a notebook showing fine-tuning of any small seq2seq model on google colab.
Also I notice this PR is marked draft. Are you intending to add more things to this?

maxzuo · 2025-08-18T15:42:53Z

@Datta0 sure I'm actively working on it, actually why I converted this to a draft. Will let you know!

Aman-byte1 · 2025-10-25T16:06:10Z

@maxzuo did u work on it?

maxzuo added 2 commits August 13, 2025 23:37

added support for seq2seq models

c618f0f

only trigger logic if auto_model is None (kept from original logic)

9ae79c1

This was referenced Aug 14, 2025

please give t5 support. #719

Open

Support T5 models #643

Open

maxzuo added 2 commits August 14, 2025 11:05

added support for peft loading of Seq2Seq LM

77c4f76

added missing import

765dd77

maxzuo marked this pull request as draft August 14, 2025 18:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Support for Seq2Seq Models (T5, T5Gemma, etc.) #3153

Support for Seq2Seq Models (T5, T5Gemma, etc.) #3153

maxzuo commented Aug 14, 2025

Uh oh!

Datta0 commented Aug 18, 2025

Uh oh!

maxzuo commented Aug 18, 2025

Uh oh!

Aman-byte1 commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Support for Seq2Seq Models (T5, T5Gemma, etc.) #3153

Are you sure you want to change the base?

Support for Seq2Seq Models (T5, T5Gemma, etc.) #3153

Conversation

maxzuo commented Aug 14, 2025

PR Description

Why

Links

Uh oh!

Datta0 commented Aug 18, 2025

Uh oh!

maxzuo commented Aug 18, 2025

Uh oh!

Aman-byte1 commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants